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Abstract .  Consider  a  q-variate  weakly  stationary  stochastic  process  { }  with 

the  spectral  density  W.  The  problem  of  autoregressive  representation  of  T Xn > 

or  equivalently  the  autoregressive  representation  of  the  linear  least  squares 

predictor  of  X  based  on  the  infinite  past  is  studied.  It  is  shown  that  for 
n 

every  W  in  a  large  class  of  densities,  the  corresponding  process  has  a  mean 
convergent  autoregressive  representation.  This  class  includes  as  special  sub¬ 
classes,  the  densities  studied  by  Masani  (i960)  and  the  author  (1984).  As  a 

consequence  it  is  shown  that  the  condition  W_1  e.  L1  or  minimality  of  {X  1 

q*q  n 

is  dispensable  for  this  problem.  When  W  is  not  in  this  class  or  when  W  has 

zeros  of  order  2  or  more,  it  is  shown  that  in  this  case  (X  1  has  a  mean  Abel 

n 

summable  or  mean  compounded  Ceslro  summable  autoregressive  representation. 
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1.  Introduction. 

/ 

>  While  it  is  well-known  that  every  purely  nondeterministic  full  rank  q- 

variate  weakly  stationary  stochastic  process  0^}  with  the  spectral 

density  W  has  an  (infinite  order)  one-sided  moving  average  representation,  not 

every  such  process  can  have  a  mean  convergent  (infinite  order)  autoregressive 

representation  (ARR)  and  the  problem  of  ARR  of  such  processes  has  not  received 

the  attention  which  it  deserves.  Due  to  the  importance  of  ARR  in  prediction 

theory,  and  particularly  in  the  statistical  theory  of  mul ti variate  time  series, 

this  paper  is  devoted  to  the  problem  of  finding  the  weakest  condition  on  W 

which  guarantees  the  existence  of  an  infinite  order  ARR  for  {X  }. 

/" 

To  be  more  precise,  we  say  that  the  SSPtX^}  has  a  mean  convergent  (sumin- 
able)  ARR  if  there  exists  a  sequence  {A^}*  of  constant  q  *  q  matrices 
such  that,  for  each  n, 

(1-1>  Xn  ‘  1  V„-k  *  V 

k=l 

where  (e  }  is  the  innovation  process  of  {X  }  and  the  infinite  series 
«,  n  n 

£  A.  X  .  is  to  be  convergent  (summable)  in  the  mean.  This  representation  of 
k=l  K  n'K 

the  process  { Xn >  as  an  infinite  order  stochastic  difference  equation  can  also 
be  regarded  as  the  inversion  of  the  one-sided  moving  average  respresentation  of 
(Xn>  given  in  (2.6).  Such  inversion  of  the  one-sided  moving  average  representa¬ 
tion  of  a  q-variate  SSP  plays  a  vital  role  in  the  statistical  estimation  of  the 

parameters  of  (X  }.  For  the  notation  and  definitions  see  Section  2. 
n 

It  is  obvious  that  the  problem  of  ARR  {X^},  cf.  (1.1),  is  equivalent  to 

A 

the  problem  of  ARR  of  X  i  .  (the  linear  least  squares  predictor  of  X  based 


-3- 


which  has  been  studied  by  Wiener  and  Masani  [16],  and  Masani  [7], 

It  follows  from  the  isomorphism  between  the  time  and  spectral  domains  of 

(X^ }  that  the  infinite  series  (1.1)  or  11.2)  is  mean  convergent  (summable), 

2 

if  and  only  if  the  isomorph  of  in  L  (W)  has  a  convergent  (summable) 

.  .  .  O 

Fourier  series  in  the  norm  of  L  (W) .  For  a  purely  nondeterministic  full  rank 
SSP  (X^ }  with  the  spectral  density  W  =  and  G  the  one-step  ahead  predic¬ 
tion  error  matrix,  it  is  well-known  [6,  II,  p .  1 15 ]  that  the  function  G*5  e1n0 

2 

in  L  ( w)  is  the  isomorph  of  en  in  M(X).  Thus,  the  series  in  (1.1)  is  mean 
convergent  (summable),  if  and  only  if  the  Fourier  series  of  ‘J’-1  is  convergent 

(summable)  to  *  in  the  norm  of  L  (W).  Also,  it  can  be  shown  that  the 

(^l)-step  ahead  (v  >  0)  linear  least  squares  predictor  ^n+v |n_i  based  on 
(X  ;  has  3  mean  conver9ent  (summable)  ARR,  if  and  only  if  the  Fourier 

_1  _i  2 

series  of  4>  is  convergent  (summable)  to  o  in  the  norm  of  L  (W).  (This 

latter  assertion  for  the  univariate  processes  is  proved  by  Miamee  and  Salehi  [11] 

in  the  spectral  domain  and  by  Bloomfield  [3]  in  the  time  domain.) 

From  the  previous  discussion  the  convergence  (summabil ity)  of  the  Fourier 
-1  -1  2 

series  of  <t>  to  in  the  norm  of  L  (W)  emerges  as  the  only  spectral 
necessary  and  sufficient  condition  for  the  existence  of  a  mean  convergent  (summ¬ 
able)  ARR  of  (X^}.  Although  this  condition  is  not  concrete  in  terms  of  W,  it 
is  extremely  useful  in  obtaining  some  useful  concrete  sufficient  conditions  in 

terms  of  W  for  the  ARR  of  {X  }.  These  conditions  are  stated  and  proved  in 

n 

Section  3  by  using  some  techniques  from  harmonic  analysis.  In  the  following 

we  review  and  discuss  the  implication  of  these  conditions  for  the  problem  of 

ARR  of  {X  }. 

n 


In  1958,  it  was  shown  by  Wiener  and  Hasani  [16]  that  the  boundedness  con¬ 


dition  c  I  s  H  s  d  I,  where  0  <  c  <  d  <  ®,  is  sufficient  for  the  existence 
of  a  mean  convergent  ARR  of  {X^}.  Later,  Masani  [7,  Theorem  5.2]  weakened  this 
severe  boundedness  condition  considerably  and  replaced  it  by 


(1.3) 


W  e  L 


qxq* 


e  L 


1 

qxq- 


It  was  pointed  out  by  Masani  [7,  p.  143]  that  the  condition  W  c  |_“  in  (1.3) 
is  unduly  strong  and  it  would  be  worth  while  to  relax  it.  (It  was  also  conjec¬ 
tured  by  Feldman  [4]  that  the  condition  U  e  L*  is  dispensable.) 

qxq 

oo 

In  [13]  the  author  has  shown  that,  indeed,  the  restriction  W  e  L  is 
dispensable.  This  is  done  by  employing  the  equivalence  between  the  convergence 
of  Fourier  series  of  all  functions  in  L^(W)  and  the  positivity  of  the  angle  o 
between  the  "past  and  present"  subspace  and  the  "future"  subspace  of  IX  }  cf. 
Theorem  3.1.  Thus,  we  have  from  Theorem  3.1  that  (Xn)  has  a  mean  convergent 
ARR  if 

(1.4)  G  >  0  (  or  p(W)  <  1). 


For  q  =  1,  i.e.  a  scalar  density  W,  it  is  well-known  [b]  that  p(W)  •  1, 
if  and  only  if  W  =  exp(u  +  v) ,  where  u  and  v  are  bounded  real -valued  func¬ 
tions  with  ||  v ||  ^  <  ir/2.  V  denotes  the  harmonic  conjugate  of  V.  Trom  this 

characterization  of  scalar  densities  satisfying  (1.4)  it  follows  that  a  matrical 
density  satisfying  (1.4)  is  not  necessarily  bounded.  For  some  partial  character 
izations  of  matrix-valued  densities  satisfying  (1.4)  see  [2,  13]. 

It  is  shown  in  [13]  that  (1.3)  and  (1.4)  offer  twc  independent  sufficient 

conditions  forthe  existence  of  mean  convergent  ARR  of  {Xn>.  Furthermore,  condi 
tions  (1.3)  and  (1.4)  both  entail  that  w'1  e  L*  ,  i.e.  they  require  that  the 

qxq 

process  {X^ >  to  be  minimal,  cf.  [7],  which  we  feel  is  a  strong  restriction  for 


the  ARR  of  (X  }  and  it  is  desirable  to  relax  it. 
n 

Our  first  new  result  in  this  paper  shows  that  the  condition  W~*  e  L* 

qxq 

or  minimality  of  { Xn >  in  (1.3)  is  dispensable  for  the  existence  of  a  mean 
convergent  ARR  of  {X^}.  Also  this  result  gives  a  more  general  sufficient 
condition  for  the  mean  convergent  ARR  of  {X^}  which  includes  both  (1.3)  and 
U .4)  as  special  cases. 

To  state  this  result  and  for  later  use  we  denote  the  class  of  densities 
satisfying  (1.3)  by  M,  those  satisfying  (1.4)  by  A  and  define  a  new  class 
A  ®  M  by  A  <g)  M  =  {W;W  =  W**  e  A  and  e  HI,  where  denotes 

the  positive  square  root  of  .  Note  that  by  choosing  =  I  (W?  =  I)  we  see 
that  this  new  class  has  M(A)  as  its  proper  subset.  Now,  we  can  state  our  first 
theorem  which  is  an  immediate  consequence  of  Theorem  3.2.  We  note  that  this 
theorem  is  a  multivariate  extension  of  a  similar  (univariate)  result  of  Bloomfield 
[3]  and  that  the  factorization  W  *  used  by  Bloomfield  is  not  suitable  in 

the  multivariate  setting,  since  the  product  of  two  positive  definite  matrices  is 
not  necessarily  a  positive  definite  matrix. 

I heorem  1.1.  Let  { Xn }  be  a  purely  nondetermini  Stic  full  rank  SSP  with  the 
spectral  density  matrix  W.  If  W  e  A  ®  M,  then  {Xn }  has  a  mean  convergent 
ARR. 

It  is  easy  to  check  that  a  W  in  A®M  does  not  necessarily  have  the 
property  W_1  e  L*xq.  (As  an  example  when  q  =  1,  one  can  take  W  =  |l  -  e1'°jX  , 

1  a  2.)  But,  for  Wf.  A  ®  M ,  W  is  necessarily  in  L*2  .  Note  that  the 

qxq 

i  i  0  2 

scalar  density  W  =  |1  -  e  |  which  corresponds  to  the  univariate  SSP 

Xn  =  'n  "  (  n-1  does  not  be1on9  to  the  univariate  version  of  A®M.  Thus, 

Theorem  1.1  does  not  provide  any  information  concerning  the  existence  of  a  mean 

convergent  ARR  for  this  process.  However,  this  process  (X  }  can  not  have  a 

n 

mean  convergent  ARR,  since  in  this  case  the  infinite  series  in  (.1.1),  i.e. 


-  )  X  .  does  not  converge  in  the  mean.  This  example  shows  that  processes 
k=l  n'k 

{X  }  for  which  U  is  not  in  A©M  (or  in  other  words  if  W  has  zeros  of  order 

n 

2  or  more),  can  not  have  mean  convergent  AKR.  in  view  of  this  it  is  natural  to 
ask  whether  such  processes  can  have  an  ARR  with  a  weaker  requirement  of  conver- 

oo 

gence,  say  summability,  for  the  infinite  series  £  A  X  .  in  (1.1).  The 

k=l  K  n"K 

next  theorem  shows  that  this  is  actually  possible  for  a  large  class  of  processes 
when  we  replace  the  mean  convergence  of  the  series  (1.1)  by  its  mean  Abel  summa¬ 
bility,  and  compounded  Cescfro  summability. 

We  say  that  {X  }  has  a  mean  Abel  summable  ARR  if  the  infinite  series 
"  -  k 

in  (1.1)  is  Abel  summable  in  the  mean,  i.e.  lim  £  r  A  X  exists  in  the 

r+rk=l  *  n"K 

mean  for  each  n.  The  next  theorem  is  immediate  from  Theorem  3.4(c). 

Theorem  1.2.  Let  {X^ }  be  a  purely  nondeterministic  full  rank  SSP  with  the 

spectral  density  W.  Let  P  be  a  complex -valued  trigonometric  polynomial  of 

2 

some  degree  n  and  VI1  e  A©M.  If  Vi  =  )P|  W',  then  {X^ }  has  a  mean  Abel  summ¬ 
able  ARR. 


From  practical  point  of  view  (specially  for  the  purpose  of  prediction) 


the  mean  Abel  summable  ARR  of  {X  i  is  not  particularly  useful  for  the  following 
^  n  ^ 

*  °°  * 

k  a  v  ...  L.  ..  ... ..  4  .1  u  . .  V  .  ^ 


reason:  11m  Y  rK  A.  X  .  can  be  approximated  by  Y  r  *  A.  X  .  ,  where  rn 
r-1  k‘l  k  n'k  k=l  0  k  "‘k  0 


is  near  one.  Then,  since  only  a  finite  segment  of  the  past  is  available,  one 

r  k  N  k 

has  to  further  approximate  the  infinite  sum  l  rQ  Afc  Xnk  by  £  rfl  Ak  Xn_k, 

k — 1  k — 1 


where  N  is  the  number  of  available  observations  from  the  past.  This  introduces 

A 

two  sources  of  error  in  computing,  say,  X  i  ..  in  view  of  this  it  is  desirable 

n|n-i 

to  replace  the  Abel  summability  of  the  series  in  (1.1)  by  another  method  of 
summability  which  reduces  the  two  sources  of  error  to  just  one.  For  this  it 
seems  that  the  Cesiro,  or  compounded  Cestfro  summability  is  more  appropriate. 

In  the  following  we  provide  a  different  reason  (motivated  by  a  computational 
problem  in  prediction  theory)  for  the  feasibility  of  the  compounded  Cestfro 


summability  method  in  ARR  of  (X  }. 

n 

Since  $  ,  .  e  Mn’*(x),  it  is  well-known  that 

n|n-l  -® 


( 1  -5  J 


1  im 
N-xo 


E  ANk  Xn-k’ 
k=l 


where  the  A.,  's  are  some  q  *  q  constant  matrices  to  be  determined.  For  a 
fixed  N,  the  matrices  ANk,  1  ^  k  ^  N,  can  be  found  by  inverting  an  (N+l)q*(N+l)q 
matrix.  However,  as  N  increases  this  procedure  requires  inverting  larger  matrices 
and  thus  becomes  less  feasible  computationally.  This  is  caused  mainly  by  the  depen¬ 
dence  of  ANfc  on  N.  Thus  to  overcome  this  computational  problem  it  is  desirable  to 
find  a  condition  on  W  which  guarantees  that  either  the  A^’s  in  (1.5)  can  be 

replaced  by  A  '$  or  by  a  .  A  ,  where  a  is  a  scalar  depending  only  on  k 
k  N,K  K  s  Nk 

and  N  but  not  on  W.  Obviously,  if  the  former  case  prevails  then  (1.5)  reduces 
to  (1.2)  or  (1.1),  and  if  the  latter  case  prevails  we  have 


(1.6) 


X 


n 


1  im 

N-xo 


N 

l  a,„.  A,  X  .  +  e  . 
Nk  k  n-k  n 


In  either  case  we  have  Ak  =  k  =  1,2,...,  where  the  D^'s  are  the  Fourier 

(Taylor)  coefficients  of  ♦"*,  cf.  [9]. 

a> 

Let  De  a  fixed  increasing  sequence  of  positive  numbers  such  that 

‘‘n 

1  im  -jjp  =  0.  We  say  that  (X  }  has  a  mean  compounded  Cesa'ro  summable  ARR  (corre- 

N ""  n 

sponding  to  («.,>)  if  (1.6)  holds  with 
N 


N  -  1,2,...,  k  -  1,2,...,N.  For  more  information  on  the  compounded  Cesaro  summa¬ 


bility  and  the  o  's  see  Section  3,  Theorem  3.5  and  the  discussion  preceeding  it. 


Now,  the  proof  of  the  next  theorem  is  immediate  from  Theorem  3.5(b). 

Theorem  1.3.  Let  (X  }  be  a  purely  nondeterministic  full  rank  SSP  with  the 

n 

spectral  density  W.  Let  P  be  a  complex-valued  trigonometric  polynomial  of 

some  degree  n  and  W  e  A  ®  M.  If  W  =  |Pl2W,  then  the  process  (X  }  has 

n 

a  mean  compounded  Cescfro  summable  ARR. 

At  the  end,  we  would  like  to  note  that  Theorems  3.4  and  3.5  show  the 
existence  of  a  mean  Abel  (compounded  Cesaro)  summable  ARR  of  {X^}  only  when  the 
spectral  density  matrix  has  a  finite  number  of  zeros  (of  any  finite  orders)  on 
[-TT,irJ .  As  yet,  we  do  not  have  any  in'jrmation  on  the  ARR  of  a  process  whose  spec¬ 
tral  density  has  a  zero  of  infinite  order.  For  q  =  1,  W ( e )  =  exp{ - 1 0 1 -x } , 

0  <  \  <  1,  provides  a  family  of  such  densities. 


2.  Notation  and  Preliminaries. 

-  q 

Throughout  this  note  for  a  q  *  q  matrix  A  =  (a..),  trA  =  ]  a.., 

II  1=1  11 

A*  =  (a..),  det  A  stands  for  the  determinant  of  A  and  A  for  the  inverse 

J  1 

of  A  whenever  it  exists.  For  two  q  x  q  matrices  A  and  B,  A  >  B  means  that 
A  -  B  >  0,  i.e.  A  -  B  is  a  nonnegative  definite  matrix.  Functions  will  be 

defined  on  [-it, it]  and  we  identify  this  interval  with  the  unit  circle  in  the 

complex  plane  in  the  natural  way.  Typical  value  of  a  function  defined  on  [-n,it] 


or  on  the  unit  circle  will  be  denoted  by  f(e)  or 


stands  for 


do  71 

and  dm  for  the  normalized  Lebesgue  measure  on  [-tt.tt],  i.e.  dm(o)  =  — -  .  For 

c  TT 

0  <  P  <  o n,  LP(HP)  denotes  the  usual  Lebesgue  (Hardy)  space  of  functions  on  the 
unit  circle,  Lp  (Hp  )  denotes  the  space  of  all  q  *  q  matrix-valued  functions 

qxq  qxq'  r  n  ^ 

whose  entries  are  in  LP(HP). 

2 

Let  (ft,B,P)  be  a  probability  space  and  M  =  Lg(ft,B,p)  the  Hilbert 
space  of  all  complex-valued  random  variables  on  ft  with  zero  expectation  and 
finite  variance.  The  inner  product  in  M  is  given  by  (x,y)  =  Exy,  x,  yrM.  In 
the  following,  we  introduce  a  few  concepts  which  are  needed  in  this  study,  for 


-y- 

mo re  discussion  with  proofs  see  [9,  16]. 

For  an  integer  q,  1  <  q  <  Mq  denotes  the  cartesian  product  of  M 

with  itself  q  times,  i.e.  the  set  of  all  column  vectors  X  =  (Xj.x^,. . . ,x  ) 

w.th  x  e  M,  i  =  l  ,2, . . .  ,q.  f1q  is  endowed  with  a  Gramian  structure.  For 

X  and  Y  in  Mq  their  Gramian  is  defined  to  he  the  q  *  q  matrix  (X,Y)  = 

[(x.,yj)].  It  is  known  that  Mq  is  a  Hilbert  space  under  the  inner  product 

( ( X ,Y) )  =  t.r(X,Y)  and  norm  ||  X  j j  =  /( (X ,X) )  provided  the  linear  combinations 

are  formed  with  constant  q  *  q  matrices  as  coefficients. 

Let  (X  ;  n  e  7L)  <=  Mq.  {x  ;  n  e  2}  is  said  to  be  a  q-variate  stationary 
n  n 

stochastic  process  (SSP)  if  the  Gram  matrix  depends  only  on  m-n,  for 

all  integers  m  and  n.  Such  a  process  has  a  spectral  representation,  i.e.  for 
all  integers  n 

(2.1)  X  -  [  e'in0dZ(e), 

n  J 

where  Z(.)  is  a  countably  additive  orthogonally  scattered  M^-valued  measure 

[9]. 

ihe  q  x  q  >. ^negative  hermitian  matrix-valued  measure  F(.)  =  (Z(.),  Z(.)) 

is  called  the  spectral  measure  of  {X  }.  In  case  F  <  <  m,  we  say  that  (X  } 

n  n 

has  the  spectral  density  W  =  dF/dm  on  [-n,ir].  It  can  be  shown  that  W(.) 
is  a  nonnegative  hermitian  matrix-valued  function  and  for  all  integers  m,n, 

(2.2)  (VV  =  {  e‘l(nw,)0W(e)dm(e). 

For  each  SSP  (x  },  let  Mm(X)  =  Tp  (X.  ;  n  s  k  <  m},  -»  <  n  <  m  <  «■> 

n  n  k 

where  sp{  }  stands  for  the  closed  linear  span  of  elements  of  {  }  in  the 

norm  of  Mq.  The  time  domain  of  (X^)  denoted  by  M(X)  is  defined  by  M(X)  = 

M  (X;.  The  spectral  domain  corresponding  to  the  spectral  density  matrix  W 


-10- 


is  denoted  by  l  (W)  and  is  defined  by 


2  ? 
L  (H)  s  {»;  f  a  q  x  q  matrix  valued  function  with  II  y  II 

w 


tr4'*(e)W(e)f{e)dm(0)  <  «>}. 


It  is  known  that  L  (W)  with  inner  product  given  by 


((<t>, s,))w  =  tr  4>*(6)W(e)'il(0)dm(0) 


is  a  Hilbert  space  and,  [9,  Theorem  7.3],  that  the  correspondence 


(2.3) 


T:4< 


H*  (fl  )d  z(  0) 


is  an  isometric  isomorphism  on  L  (W)  onto  M(X).  T  is  called  the  Kolmogorov 
isomorphism  between  the  spectral  and  time  domains.  It  follows  from  {?.?)  and 
(2.3)  that 


(2.4) 


(  l  Vr  l  Vk>  *  <  l  N1 

jej  J  J  1 


keK 


where  J  and  K  are  finite  sets  of  integers  and  A.,  B.  are  q  *  q  constant 

J  <3 


matrices.  An  important  problem  in  the  prediction  theory  of  an  SSP  {x  }  is  to 


find  a  (meaningful)  series  representation  in  the  time  domain  which  corresponds 

2 , 


to  a  given  v  in  the  spectral  domain  L  (w).  The  identity  (2.4)  shows  that  this 
can  be  done  very  easily  when  4*  is  a  polynomial  of  finite  degree.  However,  this 
problem  is  very  complicated  when  ¥  is  not  of  this  form. 


The  prediction  problem  of  an  SSP  (Xn }  with  the  spectral  density  W 


can  be  stated  as  determining  the  matrices  Ak  (in  terms  of  W),  0  k  ■  ...  such 


that  for  a  fixed  v  >  1,  the  linear  least  squares  predictor  of  X  ,  i.e. 

n  +  v 


sV- 


:> 


♦  ivi 


t'. 


r.  . 


•v 

r,  .- 

<-.V 


f. ■/  s'  •.'<**•*  *•"  s'  •;  s'  s*  *  ■  •  ”  .  ■  .  ■  r  .*  »-  \ 
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X  ,  =  { X  | M  ( X) )  can  be  written  as 

ntv  n+v'  -°°  ' 


(2.5) 


A  N 

X  =lim  [  A.  X  =  I  A,  X  .  , 
n+v  M-x»  k=0  k  n‘k  k=0  k  n"k 


where  for  a  vector  X  £  Mq  and  a  subspace  S  of  Mq,  (X  |  S)  denotes  the  ortho¬ 
gonal  projection  of  X  on  the  subspace  S  of  Mq. 

Let  c  =  X  -  (X  |Mn"*(X)),  n  e  n  .  The  stochastic  process  {e  ;  n  e  TL) 

n  n  n  -°°  n 

is  called  the  innovation  process  of  {X  }.  It  is  known  that  (e  ,  e  )  =  6_  G, 
-  n  m  n  mn 

where  G  =  (eq,  Eq)  is  the  prediction  error  matrix  for  lag  1.  The  SSP  { Xn >  is 

said  to  be  of  full  rank  if  the  matrix  G  is  nonsingular.  Throughout  this  note 

we  assume  that  f X^ }  is  a  purely  nondeterministic  full  rank  process.  This  is 

equivalent  to  assuming  that  ( >  has  a  spectral  density  W  such  that  log 

iog  det  W  r  L  and  W  =  <H*,  where  i  e  H  is  an  outer  function  with  <t> (0)  = 

qxq 

1 

G  Y  We  refer  to  !>  as  the  generating  function  of  the  process  {X^l.  Since  { } 

has  full  rank,  we  can  define  a  process  (Y  },  called  the  normalized  innovation 

n 

process  of  {X  } ,  by  Y  =  G_i*e  .  Then  (Y  ,Y  )  =  6  I.  By  using  the  Wold's 
n  n  n  m  n  mn 

decomposition  [9]  we  get  an  infinite  order  one-sided  moving  average  representation 

for  {X  1: 

n 


(2.6) 


x  =  y  c.y 

n  k  n-k 


♦  (e)  -  l  c  e,ke.  <c  =  gY 

k=U  *  ' 


As  a  measure  of  the  angle  between  the  past-present  and  future  subspaces 
of  the  process  { }  we  take  its  "cosine"  defined  by 

p(W)  =  sup(  |  ( (Y,Z)  )|:  Y  e  M°00(X),  Z  e  M*(X) 


and  ||  Y ||  s  1,  ||  Z ||  s  1). 
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It  is  clear  that  0  <  p(W)  <  1.  The  past-present  and  future  are  said  to  he  at 
positive  angle  if  p(W)  <  1. 


-1  7 

3.  Mean  Convergence  of  the  Fourier  Series  of  »  in  L  (W). 

As  noted  in  Section  1,  mean  convergence  of  the  Fourier  series  of  * 


-1 


in 


L  (W)  emerges  as  the  only  necessary  and  sufficient  condition  for  the  autoregres¬ 
sive  representation  of  a  purely  nondeterministic  full  rank  process  with 

the  spectral  density  W.  This  section  is  devoted  to  finding  useful  sufficient 

conditions  on  W  which  guarantee  the  mean  convergence  of  the  Fourier  series  of 

-1  2 
*  and  many  more  functions  in  L  (W). 

We  would  like  to  note  that  for  a  general  density  W,  <t>-1  is  not 

necessarily  in  L*  and  therefore  the  Fourier  coefficients  of  are  not 

well-defined.  For  the  next  theorem  we  need  the  natural  assumption  on  W,  that 

2  1 

is  W  is  such  that  L  (H)  c  l  .  Under  this  assumption  the  Fourier  coefficients 

Qxm 

of  every  function  in  L  (W)  is  well-defined.  It  can  be  shown  [10,13,14]  tnat 
L2(W)  c  LL  if  W  1  e  lL.  and  °nly  (det  W,-1'*  e  L*.  Thus  this  assump- 

q  xq  q  xq 

tion  is  weaker  than  p(K)  <  1. 

The  next  theorem  which  is  a  multivariate  extension  of  a  deep  theorem  of 

Helson  and  Szego  [6,  p.  131]  provides  a  necessary  and  sufficient  condition  for 

2 

the  mean  convergence  of  the  Fourier  series  of  every  function  in  L  (w) ,  c.f.  [10, 
13,  14]. 


Theorem  3.1.  Let  W  be  a  q  *  q  matrix-valued  density  function.  Then  p(W)  <  1, 

2  1  2 
if  and  only  if  L  (W)  c  l  and  the  Fourier  series  of  any  v  in  L  (W)  converges 

to  V  in  the  norm  of  l^(W). 

Next  we  find  a  weaker  sufficient  condition  on  W  which  guarantees  the 
convergence  of  the  Fourier  series  of  every  function  in  a  smal I  subclass  $  of 
L2(W).  For  a  density  W,  we  define  S  ■  {v  e  L2(W);  TWf*  t.  1.  Note  that 
this  class  has  ♦"*  as  one  of  its  elements.  In  the  following  theorem  we  show 


that  if  W  c  A®M,  then  the  Fourier  series  of  every  ¥  in  S  converges  to  ¥ 
2 

in  the  norm  of  L  (W) .  This  theorem  is  a  matricial  extension  of  a  similar  uni¬ 
variate  result  due  to  Bloomfield  [3]. 

Theorem  3.2.  Let  W  be  a  q  *  q  matrix-valued  density  function  and  S  = 

It  r  L^(W);W*  e  L  }.  If  W  e  A©?',  then  the  Fourier  series  of  every 
q  *q 

•> 

function  ¥  e  S  converges  to  ¥  in  the  norm  of  L~(W). 

Proof.  Let  toj  and  w  denote  the  smallest  and  largest  eigenvalue  of  W^. 
Then  for  W  e  A®M  we  have 


13.1) 


«o.W,  <  W  <  <u  W, 

11  q  1 


a.e. (Leb.) , 


and  for  ¥  e  S  we  have 


jtr¥  M1»*dm  <  w"1  tr¥  W¥*dm  <  ||  tr¥  W¥*||  w  jw"1  dm  <  », 


2 

which  proves  that  ¥  e  S  implies  ¥  e  L  (Wj). 

Since  ¥  e  L  (Wj)  and  p(Wj)  <  1,  it  follows  from  Theorem  3.1  (and  the 
argument  proceeding  it)  that  the  Fourier  series  of  every  ¥  e  S  converges  to 
t  in  the  norm  of  L^(Wj),  i.e.  with  denoting  the  symmetric  n-th  partial 
sum  of  the  Fourier  series  of  ¥  we  have 


which  follows  from  (3.1). 


Q.E.D. 
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As  mentioned  in  Section  1,  a  density  in  A©H  can  not  have  a  zero  of 
order  2  or  more.  Thus  Theorem  3.2  does  not  provide  any  information  in  regard  to 
autoregressive  representation  of  such  processes.  Also  it  was  noted  that  such 
processes  can  not  have  mean  convergent  autoregressive  representation.  Therefore 
it  is  natural  to  replace  the  requirement  of  mean  convergence  of  the  series  for 
the  autoregressive  representation  by  a  weaker  mode  of  convergence  for  the 
series,  say,  summability.  In  the  following,  first  we  study  the  Abel  summability 


of  the  series  involved.  For  this  we  need  some  notation. 

For  a  density  W  we  define  H^(W)  =  sp{ein6I;  n  >  0}  in  L 2 ( W ) .  Mote 

-12  -1 
that  $  e  H  (W).  Also  we  would  like  to  note  that  for  a  general  W,  <i>  and 

2  1 

other  elements  of  H  (w)  are  note  necessarily  in  L  and  therefore  their 

qxq 

Fourier  coefficients  are  not  well-defined.  By  an  argument  similar  to  that  of 

Rosenblum  [15,  p.4l]  one  can  identify  the  elements  of  H  (w)  with  analytic 

matrix-valued  functions,  and  thus  here  by  the  Fourier  coefficients  of  elements 
2 

of  H  (W)  we  actually  mean  the  Taylor  coefficients  of  these  analytic  functions. 

1  -  r2 

Let  P  {.)  denote  the  Poisson  kernel,  i.e.  P  (e)  =  —  - ,, 

r  r  l-2rcosfl+K 

2 

0  <  r  <  1  and  -it  <  0  <  tt.  For  a  function  f  e  h  (w)  with  Fourier  (Taylor) 

coefficients  (v.  }”  the  convolution  of  P  and  y  is  defined  (and  denoted) 
k  k=U  r 


(3.3) 


•V(ei6)  -  f(rei6)  =  (P*f)(e10)  =  l  Tkrk  eikG. 

r  k=0  K 


We  say  that  the  Fourier  (Taylor)  series  of  a  function  f  n  H  \W)  is 
Abel  summable  to  f  in  the  norm  of  l'(W),  if  and  only  if 


(3.4) 


-  T||y  *  0. 

r-1  r  w 


It  follows  from  the  Isomorphism  between  the  time  and  spectral  domains 


that,  the  autoregressive  representation  of  {X  }  is  mean  Abel  summable,  if  and 

n 

only  if  the  Fourier  (Taylor)  series  of  ♦  *  is  Abel  summable  to  t  ^  in  the 
norm  of  (W) . 

Next,  following  the  pattern  of  Theorems  3.1  and  3.2,  we  find  a  necessary 

and  sufficient  condition  on  W  for  the  Abel  summability  of  the  Fourier  (Taylor) 

2 

series  of  every  function  in  H  (W).  For  this  the  matrix-valued  function  Q 
defined  (in  terms  of  $)  by 


(3.5) 


Q ( o )  =  Q(r,  0,  $)  *  4>~*(rei9)$(e10) ,  -ir  <  e  <  tt  , 


plays  an  important  role.  The  following  theorem  is  actually  a  matricial  extension 

of  some  of  the  (univariate)  results  due  to  Rosenblum  [15,  Theorem  1(H)], 

Theorem  3.3.  Let  W  =  be  a  q  *  q  matrix -valued  density  function  with 

log  det  W  e  L1.  Then  the  following  statements  are  equivalent. 

2 

(a)  The  Fourier  (Taylor)  series  of  every  function  S'  in  H  (W)  is  Abel 

2 

summable  to  v  in  the  norm  of  L  (W). 

(b)  There  exists  a  constant  Kj,  0  <  <  «,  such  that  for  all  functions 

v  *  H^(w)  we  have 


(3.6) 


P/I’ll  w^i  |M|W,  0  <  r  <  1. 


(c)  Tnere  exists  a  constant  K^,  0  <  <  ”,  such  that 


(3.7)  (pr*tr  QQ  )(8)  <  K^»  0  <  r  <  1  and  -ir  <  0  <  ir. 


Proo f .  (a) — ^  (h)  follows  from  the  uniform  boundedness  principle. 

(b)  (a)  follows  from  an  argument  similar  to  that  given  in  Rosenblum 

[15,  pp.  32-33]. 


'  ".'a'  «  •  «-•  •-*  ■-  i.V.  *  •.*  >  •  •  O  «_*  .  •  .  •  ■ 


•  *  -  *i 

•  ■  i 


V  ■/  •.* 
•  .*  •• 
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To  prove  that  lb)  implies  (c),  we  note  that  for  each  0  <  r  <  1  and 

-t  <  x  s  it  the  function  f(e)  =  ¥(9,  r,  x)  =  (1  -  re1^"^)'*  <j> ’*(o)  is  in 

2  2 
H  (W).  This  is  a  consequence  of  the  closure  theorem  for  H  »  cf.  [8,  p.  PR8]. 

o  q  h 

2  2  ie  ^ 

By  using  the  simple  inequality  4 ( 1-r  ) 1 1-r  e  |  s  < e ) ,  and  applying  (3.6)  to 
this  function  V  we  get  (3.7): 

[  P  (e-x)  tr$_1  (reie)  <t>(e)  $*{e)  $*(re10)dm(e) 

*  -IT 

IT  2 

<  4(l-r2)  [  ( 1-r2  e1  ^6‘x^|  tr  $_1(re10)W(0)  ^(re1  °)dm(o) 

'  -IT 

<  4(l-r2)  Kj  |l-re^0‘x^|  ?  tr  $-1(e)W(e)  **  (rel0)dm(o) 

“IT 

"JT 

=  4q  Kj  Pr(e-x)  dm(0)  =  4q  Kj. 

—  IT 


To  prove  (c)  implies  (b)  we  note  that  for  any  »  t  H  (W)  [using  the 
Cauchy-Schwartz  inequality,  Fubini's  theorem  and  1 3.7)1  that 


||pr^llM  =  I  tr  Yire1 9)<p( e)  $*(8)  ^(re1 

^  -IT 


)dm(e) 


=  f  tr  4,(rei 6)$-(re1 9)Q(9)Q*(e)  [l»ire^0)4(re*0)]din(0) 

/  — TT 


;  I  tr  QQ*  •  tr  ^(re1 0)<p(re1 9)  [*(re,w)*(re'  r’)]'dm(o) 

-ir  q  t  2 

=  l  tr  QQ*  |  l  4»k  ( re1 0)«*  .(re1  e)  |  dm(e) 
k,«.=l  •'-it  J=1  J 

=  |  T  tr  QQ*  |  [  P  (0-x)(  ?  fk1  * -j  j ) ( x ) dm ( x )  |  dm(o) 

k,H  =  l  -’-IT  ’  -IT  r  J=1  J  J 

q 

S  J-1  l..tr  Q!)*  '  )r’/r(e-X>  ( j,  \j 

"Ft  fit 

=  tr  v  W  v*  (  P  (x-e)tr  QQ*(e)dm(e))dm(x) 

J-tf  J-irr 


2  "  "  W 


Q.E.D. 


Although  Theorem  3.3  provides  two  equivalent  necessary  and  sufficient 

conditions  for  the  Abel  summability  of  the  Fourier  (Taylor)  series  of  every 
2 

function  in  H  (w),  these  conditions  are  hard  to  apply  and  are  not  explicit  in 

terms  of  the  components  of  W.  In  the  next  theorem  by  using  Theorems  3.1,  3.2 

and  3.3  we  provide  sufficient  conditions  for  the  Abel  summability  of  the  Fourier 

2  -1 

(Taylor)  series  of  all  functions  in  H  (W)  and  in  particular  $  ,  which  are 

easy  to  apply  and  more  explicit  in  terms  of  the  components  of  W. 

Theorem  3.4.  Let  W  be  a  q  *  q  matrix- valued  density  function  with 
log  det  W  t  L*  and  P  be  a  complex-valued  trigonometric  polynomial  of  some 
degree  n . 

2 

(a)  If  w  c  A,  then  the  Fourier  (Taylor)  series  of  every  function  ¥  e  H  (w) 

2 

is  Abel  summable  to  ¥  in  the  norm  of  L  (W). 

lb)  Let  We  A.  if  W  =  |P|2W,  then  the  Fourier  (Taylor)  series  of  every 
function  ¥  e  H2{W)  is  Abel  summable  to  V  in  the  norm  of  L2(W). 

(c)  Let  S  =  {¥  e  H2(W);  L"  >  and  W  e  A  ©M.  If  H  =  |P|ZW, 

q  xq 

then  the  Fourier  (Taylor)  series  of  every  function  ¥  e  S  is  Abel  summable 

2 

to  f  in  the  norm  of  L  (W). 

Proof.  In  view  of  Theorems  3.1  and  3.3,  proofs  of  (a)  and  (b)  are  the  same 

as  the  proof  of  Lemma  6  in  [12].  (c)  follows  from  (b)  by  using  the  method  of 

proof  of  Theorem  3.2  and  replacing  Sn  by  ¥ r.  Q  E  D 

Now,  we  turn  to  the  problem  of  Cesaro  summability  of  the  Fourier 

2  2 

(Taylor)  series  of  functions  in  H  (W).  Let  ¥  e  H  (W)  with  Fourier  (Taylor) 

oo  ^  i  Ic  e 

coefficients  {¥.}.  and  partial  sums  S  (e)  =  ]>  ¥.e  .  For  a  >  0,  we 

k  u  n  k=0  K 

say  that  the  Fourier  (Taylor)  series  of  ¥  is  (C,  a)_  summable  to  ¥  in  the 
norm  of  L2(W)  i  f 


For  scalar  sequences  it  is  well-known  that  the  strength  of  (C-a) 
methods  increases  with  a.  However,  there  are  series  which  are  Abel  summable 
but  not  (C,o) -summable  for  any  a  >  0,  cf.  [5,  p.  108],  Because  of  this  and 


in  view  of  the  importance  of  relations  like  (3.8)  in  prediction  of  (X^),  cf. 
Section  1,  we  consider  the  stronger  method  of  compounded  Cestfro  summability 

co 

method:  Let  {an>0  be  a  (fixed)  monotone  increasing  sequence  of  positive  num- 

2 

bers.  We  say  that  the  Fourier  (Taylor)  series  of  f  t  H  (W)  is  compounded 
Cesdro  summable  to  ¥  if 


1  im 

n-x» 


=  0. 


It  is  known  [l]  that  the  compounded  Cestfro  summability  method  is 

a 

regular,  if  and  only  if  lim  —  =  0.  For  more  information  on  the  subject 

n-*»  n 

of  summability  and  the  definition  of  undefined  terms  the  reader  may  refer  to 
[5]. 

The  next  theorem  establishes  analogue  of  Theorem  3.4  for  the  compounded 
Cesaro  summability. 

Theorem  3.5.  Let  W  be  a  q  *  q  matrix-valued  density  function  with 

log  det  W  e  L  ,  P  be  a  complex- valued  trigonometric  polynomial  of  some  degree 

n  and  {a  }  a  (fixed)  monotone  increasing  sequence  of  positive  numbers  with 
a  n 
lim  -H  =  0. 
n-*°  n 

(a)  Let  W*  e  A.  If  W  =  |P|^W',  then  the  Fourier  (Taylor)  series  of  every 

2  2 
function  »  t  H  (W)  is  compounded  Cestfro  summable  to  ¥  in  the  norm  of  L  (W) 

lb)  Let  W'  e  A®M  and  S  be  as  in  part  (c)  of  Theorem  3.4.  If  W  =  |P|^w', 

then  the  Fourier  (Taylor)  series  of  every  ¥  e  S  is  compounded  Cestfro  summable 

2 

to  ¥  in  the  norm  of  L  (W). 


Proof.  We  prove  only  part  (a)  since  the  proof  of  part  (b)  is  the  same  as  the 


proof  of  Theorem  3.4(c). 

2 

To  prove  (a)  first  we  show  that  every  f  t  H  (H)  has  radial  limits 

a.e.  (Leb.)  Let  <j>  and  be  the  optimal  factors  of  ( P | *~  and  W',  respec- 

tively.  Then  4>  the  optimal  factor  of  W  *  |P|  W'  is  given  by  •  Since 

-Ik 

W'  t  A  it  follows  that  4>.  e  H'r  ,  and  since  <t  is  an  analytic  polynomial 

1  qxq 

•1  6 

of  finite  degree  with  no  zeros  in  the  open  unit  disc  we  have  that  4>  e  H  , 

0  *  <s  <  ?2.  Thus  r1  e  H  1  ,  ■£-  -  2  +  x  •  Now,  we  note  that  every  v  e  H2(w) 

qxq  P.  o 

i  .1  o 

has  a  representation  of  the  form  f  s  h  *  ,  where  h  e  H  »  and  this  entails 

q*q 

that  e  with  ^  =  j  +  .  Therefore,  every  ¥  e  H2(w)  belongs  to  Hqxq» 

for  some  p  >  0  ,  which  implies  that  ¥  has  radial  limits  a.e.  The  rest  of  the 
proof  follows  from  Theorem  3.4(b)  and  adopting  the  method  of  proof  of  Theorems 
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